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Field of the Invention 

5 [0001] The present invention generally relates to the purification and/or characterization of libraries of connpounds. 
for example, combinatorial and lead generation libraries. 

Baclcground of the Invention 

10 [0002] Currently, there are many general methods for purifying synthetic compounds. These methods generally in- 
volve purifying a single target compound from multiple impurities. 

[0003] Compounds are currently being prepared in relatively large numbers in combinatorial and lead generation 
libraries. Often, compounds are synthesized in multi-well plates or multi-tube arrays, with the number of related com- 
pounds numbering in the thousands. It is difficult to purify and characterize the libraries of compounds at the rate at 

15 which they are synthesized. 

[0004] One method for purifying large numbers of compounds involves a repetitive method of chromatographing 
individual compounds. This constitutes a full cycle of synthesis, work-up, and purification for each molecule. Often, a 
large amount of time is spent developing an appropriate purification method for the compounds to be purified. 
[0005] It would be advantageous to develop efficient methods for purifying and/or characterizing libraries of com- 

20 pounds which are applicable to a wide variety of organic compounds. The present invention provides such methods. 

Summary of the Invention 

[0006] Methods for purifying and/or characterizing compounds, particularly libraries of compounds such as combi- 
25 natorial or lead generation libraries, are disclosed. Purification devices capable of being used in the method are also 

disclosed. 

[0007] The method is premised on the discovery that structurally similar compounds, such as those in combinatorial 
and lead generation libraries, often have roughly similar retention factors (Rfs) on thin layer chomatography (TLC) 
plates and retention times on high performance liquid chromatography (l-IPLC) columns. The method uses this discov- 

30 ery to determine optimum conditions for purifying a library of similar compounds. 

[0008] For a given column packing, solvent system, and flow rate, most compounds tend to elute to a certain degree 
from an analytical and/or preparative IHPLC column. Similariy, most compounds move to a certain degree on TLC 
plates. While some compounds do not move at all (Rf= 0), others move with a low Rf (for example, 0.05<Rf<0.2), a 
medium Rf (for example, 0.2<Rf<0.8), or a high Rf (for example, Rf>0.8). One can readily determine a series of three 

35 or more, preferably four or more, zones of Rfs, or, in the case of analytical IHPLCs, a series of zones of retention times, 
in which the majority of compounds in the library will elute (In the case of an analytical HPLC) or move (in the case of 
a TLC), for example, low, medium and high Rfs on a TLC plate. These zones can be correlated to preparative or semi- 
preparative methods for performing preparative or seml-preparatlve IHPLCs. 

[0009] For a given analytical IHPLC and/or TLC protocol, a set of preparative IHPLC conditions can be Identified 
40 wherein compounds in one zone can be separated from compounds in other zones. Accordingly, if all or substantially 
all of the compounds in a representative sample of a library of compounds is present In the same zone, the library can 
be purified using the same HPLC protocol, which can be readily correlated to a zone on the TLC and/or analytical HPLC. 
[0010] The methods described herein involve evaluating a representative sample of compounds from a library of 
compounds, such as a combinatorial or lead generation library, by TLC and/or analytical HPLC, to determine which 
45 zone they move on the TLC plate and/or elute off the analytical HPLC column. Once the zone is Identified, a correlated 
preparative or semi-preparative method is used to purify the library. 

[001 1] Appropriate conditions for purifying a library of compounds can be worked out by route scouting a represent- 
ative sample of the library for a given analytical HPLC column, solvent system and flow rate, and/or a given TLC backing 
and solvent system. A correlated preparative or semi-preparative HPLC method can be applied to purify the library of 
50 compounds without having to change the purification parameters, so that a single method can be applied to the entire 
library. 

[0012] A suitable sample size is typically on the order of between 2 and 5% of the library, depending on the diversity 
of the compounds in the library. This approach is referred to herein as «route scouting, » since one is scouting for an 
appropriate purification route. 

55 [001 3] Preferably, both the TLC and the HPLC are performed on the representative sample of compounds. It is also 
preferred to perform preparative or semi-preparative HPLC on a sample of compounds from the library before purifying 
the entire library. This allows one to verify that the conditions are suitable for purifying the entire library, for example, 
by determining the purity of the compounds In the representative sample. Also, one can perform a TLC on between 
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1 0 and 1 00%, preferably between 50 and 1 00% of the library and compare the TLC to that in the representative sample. 
By performing a TLC on the entire library and/or determining the purity of a representative sample of compounds, one 
can ensure that the majority of the library can be adequately purified. If the purity is not adequate on the representative 
sample, or if the TLC of the library does not sufficiently match that of the representative sample, alternative preparative 
5 HPLC conditions can be used. 

[0014] The representative sample should encompass, if possible, the most polar and least polar compounds syn- 
thesized in the library, to help ensure that the method is applicable for the entire library. 

[0015] The method described herein results In substantial time savings in the purification and characterization of 
libraries of compounds, and can provide compounds with greater than 90% purity 

10 

Detailed Description 

[0016] Compounds are often synthesized in the fonm of combinatorial libraries, which are often in the form of multi- 
well plates or multi-tube arrays. A major bottleneck in the synthesis, purification and evaluation of the compounds is 
15 determining appropriate purification conditions for the compounds. The methods described herein detail how entire 
libraries of structurally related compounds can be purified. 

[0017] The methods are premised on the discovery that structurally similar compounds, such as those in combina- 
torial and lead generation libraries, often have roughly similar retention times on thin layer chomatography (TLC) plates 
and on high performance liquid chromatography (IHPLC) columns. For a given sorbent, solvent system, and fiow rate, 
20 most compounds tend to move a certain degree on a plate. For example, some compounds do not move much at all 
(Rf near 0). Others move with a low Rf (for example, between 0.05 and 0.2), a medium Rf (for example, between 0.2 
and 0.8), or a high Rf (great than 0.8). The methods use this discovery to determine optimum conditions for purifying 
a library of similar compounds. 

[0018] An advantage of the methods is that analytical HPLC is performed, if at ail. only on a representative sample 
25 prior to performing preparative IHPLC. Using the methods described herein, the purity of all or substantially all of the 
compounds purified via preparative HPLC can be greater than 90 percent. 

Definitions 

30 [0019] As used herein, the term "preparative HPLC" and like terms is meant an HPLC system which is capable of 
producing high (500 or more) microgram, milligram, or gram sized product fractions. The term "preparative" includes 
both preparative and semi-preparative columns, but Is not intended to Include analytical columns, which provide frac- 
tions in the nanogram to low microgram range. 

[0020] As used herein, the term "mechanically actuatable" when referring to a switching valve is meant a valve whose 
35 different positions are selected by other than manual actuation, i.e., by computer selection. The actual mechanical 
actuation may be electric (i.e. a solenoid controlled valve), pneumatic (i.e. an air pressure controlled valve), hydraulic 
(a liquid pressure controlled valve), or any other equivalent means. 

[0021] As used herein, an "HPLC compatible detector" is a detector suitable for use in an HPLC system which is 
capable of providing a detectable signal upon elution of a compound peak. For example, a detector capable of gener- 
ic ating a signal when a compound elutes from the compound is an HPLC compatible detector. Where component ab- 
sorbance varies widely, it may be necessary to utilize more than one detector. A detector capable of detecting a desired 
component is not an "Incompatible" detector due to its inability to detect a non-desired peak. 
[0022] A "waste reservoir" is a destination suitable for collecting eluate that does not include the compound of interest, 
for example, the solvent used to regenerate the column between runs or the eluate driven off the column before and 
45 after the compound of interest has eluted. Suitable waste reservoirs include fiasks, bottles, or jugs. 

HPLC Devices 

[0023] Displacement chromatography (an example of which is HPLC) is based on the principle that in a sample the 
50 balance between stationary phase (SP) and mobile phase (MP) is shifted the direction of SP. Single components of a 
sample displace each other like a train and the displacing agent with the greater affinity to SP pushes this train by 
fractions out of the column. Gas chromatography, liquid chromatography and HPLC chromatography are some of the 
most well known examples of displacement chromatography 

[0024] An HPLC device typically Includes at least the following components: a column, packed with a suitable sta- 
55 tionary phase, a mobile phase, a pump for forcing the mobile phase through the column under pressure, and a detector 
for detecting the presence of compounds eluting off of the column. The devices can optionally include a means for 
providing for gradient elution, although such is not necessary using the methods described herein. 
[0025] Routine methods and apparatus for carrying out HPLC separations are well known in the art, and are de- 
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scribed, for example, in the following references: J. Chromatography, 192:222-227 (1980), J. Liquid Chromatography 
4:661-680 (1981). and J. Chromatography 249:193-198 (1982). 

[0026] Suitable stationary phases are those in which the compound of interest elutes. Preferred columns are reverse 
phase columns, which may be natural (silica gel with alkyl chains of different lengths) or a synthetic crosslinked polymer 
5 (consisting of styrene and divinylbenzene). The particle size of the stationary phase is within the range from a few ^im 
to several 100 |im. The most preferred stationary phase is a C^g column. 

[0027] Suitable detection devices include mass spectrometers and UV detectors. The methods described herein use 
both of these detectors to determine when sample should be detected. 

[0028] The methods described herein often require the use of relatively high pressures (for example, up to approx- 
10 imately 2000 psi), depending on the pump head, column size, mobile phase and sorbent particle size. These pressures 
may be in excess of normal operating conditions for standard preparative HPLC. The methods also often require using 
flow rates of mobile phase (for example, in excess of 30 mL/min) in excess of those used in normal operating conditions 
for standard preparative HPLCs. 

[0029] In some embodiments, methanol/water is used as the mobile phase. Methanol/water solvent systems are 
15 more viscous than otherwise equivalent systems employing acetonitrile/water (which are more commonly used, but 
which can be more costly). The increased viscosity tends to create a high back pressure, and can require the use of 
high pressure tubing as well as tubing with wider than normal internal diameters. Further, the use of column packings 
(sorbents) with relatively small particle sizes (for example, 5 microns or less) can cause back pressure, further man- 
dating the use of wider tubings. 

20 

Purification IVIetliod 

[0030] The method involves evaluating a representative sample of compounds from a library of structurally similar 
compounds by TLC and/or analytical HPLC to determine which zone they move on the TLC plate and/or elute off the 
25 analytical HPLC column. Once the zone is identified, a con-elated preparative or semi-preparative method is used to 
purify the library. 

[0031] One can detemiine a series of three or more zones of retention factors on a TLC plate and/or retention times 
on an analytical HPLC trace, and correlate a preparative HPLC protocol to each of the zones. The zones can be, for 
example, low, medium and high Rfs on a TLC plate, where low, medium and high are arbitrary terms which can be 
30 defined in any suitable manner such that they correspond to a correlated preparative or semi-preparative HPLC protocol 
which separates compounds in one zone from compounds in the other zones. For a given analytical HPLC and/or TLC 
protocol, a set of HPLC conditions can be identified wherein compounds in one zone can be separated from compounds 
in other zones. 

[0032] Accordingly, if a TLC or analytical HPLC shows that all or substantially all (i.e., greater than 95%) of the 
35 compounds in a library of compounds are present in the same zone, they can all be purified using one correlated HPLC 
protocol. 

[0033] As used herein, a representative sample is typically less than 10 percent of the library, more preferably, less 
than 5 percent of the library, and most preferably, between 2 and 5 percent of the library. The number of compounds 
In the sample depends on the diversity of the library (in terms of polarity, and, hence, the retention time on an HPLC 
40 column or retention factor on a TLC plate). The representative compounds should encompass, if possible, the most 
polar and least polar compounds synthesized in the library, to help ensure that the method is applicable for all com- 
pounds in the library. 

[0034] As used herein, a zone refers to a range of retention times or retention factors. For example, some compounds 
do not move much at all (Rf near 0). Others move with a low Rf (for example, between 0.06 and 0.2), a medium Rf (for 

45 example, between 0.2 and 0.8). or a high Rf (great than 0.8). Using a TLC and using these ranges of retention factors, 
three zones can be identified, low, medium and high. The zones can be correlated to preparative or semi-preparative 
HPLC conditions which purify compounds which move on the TLC plate with an Rf in the particular zone. A series of 
compounds was subjected to TLC and preparative HPLC using these representative zones, and the results are de- 
scribed in Example 1. A preferred solvent system for eluting compounds on a TLC plate Is eighty percent methanol/ 

50 20 percent water (v/v), and this is the solvent system which was used in Example 1 . 

[0035] Preferably, the purification protocol uses four or more zones. One of skill in the art can readily and subjectively 
determine an appropriate set of zones, taking into consideration a range of retention factors on a TLC plate using a 
given solvent system, or a given range of retention times on an analytical HPLC column using a given column packing 
and solvent system, and con^elate these zones to effective preparative HPLC conditions to purify alt compounds which 

55 fall into a particular zone. 

[0036] Using the methods described herein, one can quickly identify a set of conditions which applies broadly to the 
library. This results in a significantly quicker turnaround time for purifying the library when compared with traditional 
methods which involve performing an analytical HPLC on the entire library before subjecting the library to preparative 
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HPLC conditions. 

[0037] Appropriate conditions for purifying a library of compounds can be worked out by route scouting a represent- 
ative sample of the library for a given analytical HPLC column, solvent system and flow rate, and/or a given TLC backing 
and solvent system. A correlated preparative or semi-preparative HPLC method can be applied to purify the library of 
5 compounds without having to change the purification parameters, so that a single method can be applied to the entire 
library. 

[0038] Preferably, both the TLC and the HPLC are perfomned on the sample of compounds. It Is also prefen-ed to 

perform preparative or semi-preparative HPLC on a sample of compounds from the library before purifying the entire 
library. This allows one to verify that the conditions are suitable for purifying the entire library, for example, by deter- 

10 mining the purity of the compounds in the representative sample. Also, one can perform a TLC on between 10 and 
100%, preferably between 50 and 100% of the library and compare the TLC to that in the representative sample. By 
performing a TLC on the entire library and/or determining the purity of a representative sample of compounds, one 
can ensure that the majority of the library can be adequately purified. If the purity is not adequate on the representative 
sample, or if the TLC of the library does not sufficiently match that of the representative sample, alternative preparative 

IS HPLC conditions can be used. 

[0039] The method described herein results in substantial time savings in the purification and characterization of 
libraries of compounds, and can provide compounds with greater than 90% purity. 
[0040] In one embodiment, the method involves performing the following steps: 

20 a) selecting a library of compounds to be purified 

b) performing a TLC and/or an analytical HPLC on a representative sample of the library, which sample comprises 
less than ten percent of the library, 

c) determining a correlated preparative HPLC method depending on how the representative sample elutes off the 
analytical HPLC column (identifying a zone of retention times) and/or the sample moves on a TLC plate (identifying 

25 a zone of retention factors), and 

d) purifying all or substantially all of the library, 

wherein the correlated preparative HPLC method is determined based on a correlation between three or more 
zones of retention times if analytical HPLC is performed and/or three or more zones of retention factors If TLC is 

30 performed, such that if substantially all of the compounds in the representative sample fall within a particular zone, a 
correlated preparative HPLC protocol can be used to purify compounds that fall within the zone. 
[0041] Preferably, both analytical HPLC and TLC are performed in step a. The representative sample is preferably 
purified via preparative HPLC before all or substantially all of the library is purified. After the representative sample is 
purified, the purity of the compounds in the sample is preferably detenmined. This provides a fast and effective means 

35 for checking the method to ensure that it works effectively. 

[0042] After the purity is determined, one can optionally perform a TLC on between 1 0 and 1 00 percent of the library, 
preferably between 50 and 100 percent of the library, under the same or substantially the same conditions as the TLC 
performed in step a. This verifies that the purification conditions which apply to the representative sample apply also 
to the entire library. 

40 [0043] By checking the purity of the representative sample and by performing a TLC on a larger portion of the library, 
one can verify that the technique adequately purified the compounds, and that the conditions apply broadly to the entire 
library. If the desired purity is not obtained, and/or If the TLC of the representative sample and the 10 to 100 percent 
of the library show that the compounds do move into the same zone, then alternative preparative HPLC protocols need 
to be determined. In some cases, a separate protocol may be applied to a portion of the library. 

45 [0044] During the preparative HPLC of all or substantially all of the library, It Is preferred that fraction collection is 
triggered by determining the presence of the compound of interest in a sample which Is eluting off of the preparative 
HPLC column using UV and/or MS detection. 
[0045] In one embodiment, libraries of compounds are purified by 

50 a) determining the Rf or retention time of a compound or a representative sample of compounds from a library of 

compounds by TLC or analytical HPLC, 

b) correlating the Rf or retention time to a set of preparative HPLC conditions with a set of parameters which 
conrelates to the conditions used for the TLC or analytical HPLC, and 

c) performing preparative or semi-preparative HPLC on the compound or the compounds in the library, wherein 

55 

i) a portion of the eluent from the preparative or semi-preparative HPLC column Is sent to a UV detector and 
another portion of the eluent is sent to a mass spectrometer (MS); 

ii) the eluent Is discarded until UV and/or MS indicates that the compound of interest is eluting, 
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iii) the sample containing the compound of interest is characterized by l\^S. 

iv) the sample containing the compound of interest is collected; 

v) the column is washed with an appropriate solvent such that any impurities are removed from the column; 

vi) the column is re-equilibrated, and 

5 vii) the steps are repeated as necessary for each compound to be purified and/or characterized. 

[0046] The methods allow collection of UV absori3ance and MS data in the same time frame. Fraction collection by 
MS has several risks associated with it, including loss of desired compound due to miscalculation of molecular masses, 
formation of adducts, and loss of compounds that do not ionize under the conditions that are used. On the other hand, 

10 the advantage of fraction collection by MS is that only a few fractions are collected. The benefits effraction collection 
by MS are obtained and the detriments are avoided by triggering fraction collection by both UV and MS. However, the 
difficulties associated with doing so are that it is difficult to trigger fraction by UV and use a high flow gradient. To ensure 
reliable splits of flow to the MS and the fraction collection, the back pressure In the line between the make-up pump 
(an on-line dilution pump which dilutes the flow which enters the MS instrument before the second flow splitter) and 

15 the probe must be less than the back pressure in the line between the post column input and the fraction collection. 
In order to achieve the desired pressure, one can change the tubing requirements from standard HPLC devices to 
provide wider diameters to handle the Increased flow rates, and optionally use a UV/DAD (diode array detector) or 
other suitable detector to trigger fraction collection. 

[0047] The following steps can optionally be performed. Information gathered on the compounds (i.e., the UV ab- 
20 sorbance and MS information) can be stored In a relational database, preferably with other information about the 
compounds (i.e., synthesis conditions, bioassay information, yield, etc.). The compounds can be further characterized, 
for example, by ''H NMR. To more rapidly purify compounds, the HPLC can Include two or more columns, one of which 
purifies compounds while the other(s) is/are being cleaned and regenerated. This step removes the chromatographic 
equilibration downtime. 

25 

Types of Compounds which can be Purified 

[0048] Virtually any organic compound which is capable of being eluted on an HPLC column can be purified according 
to the methods described herein. Preferably, the compounds to be purified are part of a library of compounds, more 
30 preferably, a lead generation or combinatorial library of compounds. The purity capable of being obtained using the 
method is typically greater than 90%, and is preferably greater than 95%. 

[0049] The term «library» refers to at least 3, preferably from 10^ to 10^ and more preferably from 102 to 10^ com- 
pounds. Preferably, these compounds are prepared as a multiplicity of compounds in a single solution or reaction 
mixture which permits facile synthesis thereof. Each member of the library of compounds can be isolated and. optionally, 
35 characterized. 

[0050] Typically, the compounds have a core structure which can be modified at at least one position, preferably two 
or more positions, with a variety of different functional groups, In order to generate a library, for example, a combinatorial 
or lead optimization library of compounds. 

[0051] Typical core structures are linear, branched or cyclic organic compounds that include at least three cariDon 
40 atoms and at least one, and preferably at least two sites capable of undergoing a reaction to change the structure, 
usually by the addition of other molecules to the reactive site. 

[0052] Examples of families of insecticides Include 1-aryl pyrazoles, pyrroles, pyrrolidones, and nicotinic acid deriv- 
atives. However, ligand compounds which may bind to the appropriate binding site may be, for example, steroids, 
hormones, peptides, proteins, oligonucleotides, oligoribonucleotides, enzymes, and the like. 

45 [0053] Suitable core structures include, but are not limited to: peptides, proteins, oligonucleotides, oligoribonucle- 
otides, oligosaccharides, alkaloids, quinolines, isoquinolines, benzimidazoles, benzothiazoles, purines, pyrimidines, 
thiazolidines, imidazopyrazinones, oxazolopyridines, pyrroles, pyrrolidines, Imidazolldones. guinolones, amino acids, 
macrolides, penems. saccharides, xanthins, benzothiadiazlne, anthracyclines, dibenzocycloheptadienes. inositols, 
porphyrins, corrins, and carboskeletons presenting geometric solids (e.g., dodecahedrane). The core structures can 

50 be derived from naturally occurring compounds, or can include non-natural modifications (i.e., non-naturally occurring 
amino acids and nucleotides). 

[0054] Suitable modifications for the core structures include: 

1) amino acid derivatives, which Include, for example, natural and synthetic amino acid residues including all of 
55 the naturally occurring alpha amino acids, species having derivatives, variants or mimetlcs of the naturally occurring 

side chains; N-substituted glycine residues; natural and synthetic species known to functionally mimic amino acid 
residues, such as statin, bestatin, etc. 

2) nucleotide derivatives, which includes natural and synthetic nucleotides, such as adenosine, thymine, guanidlne. 
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uridine, cytosine. derivatives of tliese and variants and mimetics of the purine ring, the sugar ring, the phosphate 
linkage and combinations of some or all of these. Nucleotide probes (between 2 and 25 nucleotides) and oligonu- 
cleotides (more than 25 nucleotides) including all of the various possible structural modifications; homo and hetero- 
synthetic combinations and permutations of the naturally occurring nucleotides; derivatives and variants containing 
5 synthetic purine or pyrimidine species, or mimics of these; various sugar ring mimetics; and a wide variety of 

alternate bacicbone analogs, including but not limited to phosphodiester, phosphorothionate, phosphorodithionate, 
phosphoramidate, all^yl phosphotriester, sulfamate, 3-thioformacetal, methylene(methylimino), 3-N-carbamate, 
morpholino carbamate and peptide nucleic acid analogs. 

3) a carbohydrate derivative, which would include natural physiologically active carbohydrates; related compounds, 
10 such as glucose, galactose, sialic acids, beta -D-glucosyiamine and nojorimycin, which are both inhibitors of glu- 

cosidase; pseudo sugars, such as 5a-carba-2-D-galactopyranose, which Is l^nown to inhibit the growth of Klebsiella 
pneumonia (n = 1); synthetic carbohydrate residues and derivatives of these (n = 1) and all of the complex oligo- 
meric permutations of these as found in nature, including high mannose oligosaccharides, the Icnown antibiotic 
streptomycin (n > 1 ). 

IS 4) a naturally occurring or synthetic organic structural motif. The term "motif is defined as an organic molecule 

having or containing a specific structure that has biological activity, such as a molecule having a complementary 
structure to an enzyme active site, for example. This term includes any of the well known basic structures of 
pharmaceutical compounds including pharmacophores, or metabolites thereof. These basic structures include 
beta-lactams, such as penicillin, known to inhibit bacterial cell wall biosynthesis; dibenzazepines, known to bind 

20 to CNS receptors and used as antidepressants; polyketide macroiides, known to bind to bacterial ribosymes, etc. 

These structural motifs are generally known to have specific desirable binding properties to ligand acceptors. 

5) a reporter element, such as a natural or synthetic dye or a residue capable of photographic amplification which 
possesses reactive groups that may be synthetically incorporated into the sulfaminimide structure or reaction 
scheme, and may be attached through the groups without adversely interfering or affecting with the reporting 

25 functionality of the group. Preferred reactive groups are amino, thio, hydroxy, carboxylic acid, carboxylic acid ester, 

particularly methyl ester, acid chloride, isocyanate alkyi halides, aryl halides and oxirane groups. 

6) an organic moiety containing a polymerizabie group such as a double bond, or other functionalities capable of 
undergoing condensation polymerization or copolymerization. Suitable groups include vinyl groups, oxirane 
groups, carboxylic acids, acid chlorides, esters, amides, aziactones, lactones and lactams. Other organic moiety 

30 such as those defined for R and R* may also be used. 

7) a macromolecular component, such as a macromolecular surface or structures which may be attached to the 
sulfaminimide modules via the various reactive groups outlined above, in a manner where the binding of the at- 
tached species to a ligand-receptor molecule is not adversely affected and the interactive activity of the attached 
functionality is determined or limited by the macromolecule. Examples of macromolecular components include 

35 porous and non-porous inorganic components, such as, for example, silica, alumina, zirconia, titania and the like, 

as commonly used for various applications, such as normal and reverse phase chromatographic separations, 
water purification, pigments for paints, etc.; porous and non-porous organic macromolecular components, including 
synthetic components such as styrenedivinyl benzene beads, various methacrylate beads, PVA beads, and the 
like, commonly used for protein purification, water softening; and a variety of other applications, natural components 

40 such as native and functionalized celluloses, such as, for example, agarose and chitin, sheet and hollow fiber 

membranes made from nylon, polyether sulfone or any of the materials mentioned above. The molecular weight 
of these macromolecules may range up to about 2000 Daltons. 

[0055] Suitable chemical modifications also include chemical bonds to a suitable organic moiety, a radioactive moiety, 
45 a hydrogen atom, an organic moiety which contains a suitable electrophilic group, such as an aldehyde, ester, alkyi 
halide, ketone, nitrile, epoxide or the like; a suitable nucleophilic group, such as a hydroxyl, amino, carboxylate, amide, 
carbanion, urea or the like; or one of the other structural diversity elements defined below. In addition, the chemical 
modifications can be in the form of a ring, bi-cyclic or tri-cyclic ring system; or structure which connects to the ends of 
the repeating unit of the compound defined by the preceding formula; or may be separately connected to other moieties. 
50 [0056] The modifications can be the same or different and each may be one or more atoms of carbon, nitrogen, 
sulfur, oxygen, any other inorganic elements or combinations thereof. For example, the core structure can be modified 
with cyano, nitro, halogen, oxygen, hydroxy, alkoxy, thio, straight or branched chain alkyi, carbocyclic aryl and substi- 
tuted or heterocyclic derivatives thereof. The modifications can be in different in adjacent molecular cores and have a 
selected stereochemical arrangement about the carbon atom to which they are attached. 
55 [0057] The compounds can be laid out in a logical fashion in multi-tube arrays or multi-well plates, in the form of 
arrays of chemical compounds. Preferably, the compounds all have a central core structure, and have various modi- 
fications which permit the identification of structure-activity relationships with which to determine optimum compounds 
for a particular use. 
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[0058] The array can be ordered in such a fashion as to expedite synthesis, purification, and evaluation, to maximize 
the informational content obtained from the testing and to facilitate the rapid evaluation of that data. 
[0059] The arrays can be constructed from logically ordered and arranged sub-arrays of compounds. Sub-arrays 
can be prepared which include spatially addressable sets of structurally related individual chemical compounds, with 
5 a common structure and a variable modification of the structure. Sub-arrays are particular useful when multiple positions 
on the structure are modified, and the variation between any two compounds within a given sub-array can include, for 
example, zero (0) or one (1 ) change in a stmcture. 

[0060] These sub-arrays and arrays can be organized to form higher order arrays that include sets of arrays, and 
can be evaluated as a higher order array to provide information regarding the optimum structural features for the 

10 common core structure of interest. 

[0061] The sub-arrays can be arranged in such a manner that the direct comparisons of compounds automatically 
yields information regarding the effect known fragments have on a desired application, as well as on the effect on 
changes in physical and reactive properties. As provided by simple set theory for any number of Independently variable 
structural diversity elements n, there exists n logical higher order array arrangements, such that relational information 

15 on the effect of variation of each of the n structural diversity elements can be obtained in a similar manner by comparison 
of testing data from the relative addresses in appropriately arranged sub-arrays. 

[0062] By screening all possible synthetic variations of a core molecule, the selection of the optimal candidate is 
more a function of the data collection method than the «rational» basis for selecting the compound. The desired physical 
and chemical properties, i.e.. binding affinity and bioactivity, can be rapidly optimized, and directly correlated with the 

20 structural changes within a particular array or sub-array. 

[0063] Because the spatial address of each compound within a multi-tube rack is known, the arrays can be tested 
to generate complete relational structural information such that a positive result provides: (1 ) information on a compound 
within any given spatial address; (2) simultaneous juxtaposition of this information upon a set of systematically structural 
congeners; (3) the ability to extract relational structural information from negative results in the presence of positive 

25 results. 

[0064] Preferably, the purification Is carried out via computer control, where the location of each tube in a multi-tube 
array or each well in a multi-well plate is stored in a computer, and the identity of the compound to be synthesized Is 
stored in the computer in a «memory map» or other means for correlating the data for the compound to the position 
of the tube or well. Alternatively, the purification can be performed manually, preferably in multi-tube racks or multi-well 
30 plates, and the information stored on a computer. The compounds in the tubes can be purified and/or characterized. 
[0065] The present invention will be further understood with reference to the following non-limiting examples: 

Example 1: Analytical Methods 

35 [0066] A series of compounds were evaluated on a TLC plate with a C1 8 backing using an 80/20 v/v methanol/water 
solvent system. The compounds were found to have one of four ranges of retention factors (Rfs), Rf = 0, 0.05<Rf<0.2, 
0.2<Rf<0.8, or Rf> 0.8. 

[0067] Those compounds which did not have an Rf= 0 were eluted off of a 5 micron, 50 mm CI 8 reverse phase 
HPLC column with an internal diameter of 4.6 mm. The compounds were eluted using various gradients of two solvent 
40 systems, solvent A (water/acetonitrile 98/1.5 (v/v)) and solvent B (100% methanol). 

[0068] Those compounds with an Rf between 0.2 and 0.8 were eluted using protocol I or II as described below. Those 
compounds with an Rf between 0.05 and 0.2 were eluted using protocol II or 111 as described below. Those compounds 
with an Rf greater than 0.8 were eluted using protocol I or IV as described below. The protocols are presented below 
in tabular form. 

45 



Table 1 



Protocol 1 (analytical) 


Time (min) 


Flow rate (mLVmin) 


%A 


%B 


0.0 


1.5 


75 


25 


5.0 


1.5 


5 


95 


6.0 


1.5 


0 


100 


8.0 


1.8 


75 


25 
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Table 2 



10 



Protocol II (analytical) 


Time (min) 


Flow rate (mL/min) 


%A 


%B 


0.0 


1.5 


45 


55 


5.0 


1.5 


0 


100 


6.0 


1.5 


0 


100 


8.0 


1.8 


45 


55 


Table 3 


Protocol III (analytical) 


Time (min) 


Flow rate (mLVmin) 


%A 


%B 


0.0 


1.5 


30 


70 


5.0 


1.5 


0 


100 


6.0 


1.5 


0 


100 


8.0 


1.8 


30 


70 


Table 4 


Protocol IV (analytical) 


Time (min) 


Flow rate (mL/min) 


%A 


%B 


0.0 


1.5 


95 


5 


5.0 


1.5 


5 


95 


6.0 


1.5 


0 


100 


8.0 


1.8 


95 


5 



35 

[0069] These four protocols on an analytical column can be scaled up to the following four protocols (shown below 
in tables 5-8) on a preparative HPLC column (50 X 20 mm id with a C1 8 sorbent with a particle size of 5 microns or less). 

Table 5 



Protocol 1 (preparative) 


Time (min) 


Flow rate (mL/min) 


%A 


%B 


0.0 


28.0 


90 


10 


5.0 


28.0 


75 


25 


6.0 


30.0 


5 


95 


7.0 


25.0 


0 


100 


9.0 


25.0 


90 


10 


Table 6 


Protocol II (preparative) 


Time (min) 


Flow rate (mL/min) 


%A 


%B 


0.0 


15 


70 


30 


1.0 


28 


45 


55 
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Table 6 (continued) 



10 



35 



Protocol II (preparative) 


Time (min) 


Flow rate (mUmin) 


%A 


%B 


6.0 


28 


0 


100 


7.0 


28 


0 


100 


9.0 


30.0 


70 


30 


Table 7 


Protocol III (preparative) 


Time (min) 


Flow rate (mL/min) 


%A 


%B 


0.0 


15.0 


45 


55 


1.0 


28.0 


30 


70 


6.0 


28.0 


0 


100 


7.0 


28.0 


0 


100 


9.0 


30.0 


45 


55 


Table 8 


Protocol IV (preparative) 


Time (min) 


Flow rate (mUmin) 


%A 


%B 


0.0 


15.0 


98 


2 


1.0 


28.0 


95 


5 


6.0 


28.0 


5 


95 


7.0 


28.0 


0 


100 


9.0 


30.0 


9 


82 



[0070] In this example, the auto-injector was a Gilson 215 liquid handler with an 819 injection valve actuator. The 
fraction collector was a Gilson 215 liquid handler. The gradient pumps were two Gilson pump 306 models with 50 SC 
pump heads. The dilution pumps were Gilson 307 pumps with 1.5 SC pump heads. The equilibration pump was a 

40 programmable HPLC Gilson pump 305 model with a 50 SO pump head. The mixer was a Gilson 811 C dynamic mixer 
with an 806 Monometric model. The pressure monitor was a Gilson 806 model pressure monitor. The HPLC detectors 
used were a Gilson 170 model diode array detector with a MS detector Platform LC model from micromass. The MS 
detector contains a quadruple mass analyzer with atmospheric pressure ionization (API) source capable of using both 
atmospheric chemical ionization (APCI) and Megaflow electrospray ionization probes. The mass spectrometer is 

45 equipped with a rotary pump and a transformer The switching valves were two Gilson valve mates and a 10 port 
Rheodyne switching valve. The splitter was one 1/1000 ACURATE by LC packing and an Upchurch splitter. The data 
collection system was a Digital CELEB GL-2 computer, monitor and Hewlett Packard Laser Jet 6P printer that included 
micromass Masslynx NT 3.1 B6 work under Windows NT V. 4.0. OpenLynx Version™, FractionLynx™ and Gilson's 
Unipoint v. 1 .64 software. 

50 [0071] The automated prep-HPLC/UV/MS was assembled as shown below: 



55 
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15 




I 




1 


















IS 




20 














25 



30 



35 



40 



45 



[0072] The tubing sizes for tlie system are as follows: 

[0073J From the mixer to the injector, the injector to the column, the column to the UV diode array detector and from 
the UV to the splitter box was Green Peek, Upchurch Scientific, INC 0.03" ID. From the make-up pump to the splitter 
box was Green Peek, Upchurch Scientific. INC 0.01" ID. From the splitter box to the upchurch splitter was Green Peek, 
Upchurch Scientific, INC 0.007" ID. From the Upchurch splitter to the waste reservoir was Green Peek, Upchurch 
Scientific. INC 0.01" ID. From the splitter to the fraction collector was 0.04" ID (Tubing Accurate/FC cat. No. PE-1000 
FC). 

[0074] The tubing sizes permitted the use of a 30 mUmin flow rate and a relatively small sorbent particle size (5 
microns or less) with the purification method described below. With the smaller tubing sizes typically found in conven- 
tional HPLCs. it would be extremely difficult to achieve this high of a flow rate, and, accordingly, it would be extremely 
difficult to perform the purification method. 

[0075] The diagram above illustrates the path of sample isolation during an HPLC run. After injection, the solvents 
from the gradient pump separate the components in column A while column B is being re-equilibrated by the reequili- 
bration pump. A ten port Rheodyne switching valve controls this action. The separated components enter into the UV 
diode array detector and then go to the splitter box. At the splitter box, the flow splits in a 1/1000 ratio. Only one part 
goes to the MS and the remainder goes to the waste reservoir after the compound of interest is detected by UV or MS, 
and the fraction collection is triggered and the sample collected. The flow that goes to the MS is diluted by a dilution 
pump with a flow of 1.5 mUmin for the electrospray (ESP) probe. Then, the diluted flow is split again by a second 
splitter (an Upchurch splitter) and only approximately 300 \iL enters the ESP probe and the remainder is sent to waste. 
A second splitter Is not required for the atmospheric pressure chemical ionization (APCI) probe. The gradient pumps 
were controlled using Unipoint software. 

[0076] The delay time when the MS is seeing a peak and when the peak reaches the fraction collector, as well as 
the delay time between the UV detector and the MS were measured using coumarin as a standard, using a methanol/ 
water (50/50) mobile phase, and are shown below in Table 9. 



Table 9 



Probe 


Gradient flow rate (mU 
min) 


Make-up flow rate (miy 
min) 


Time delay (second) 
MS-FC 


Time delay (second) 
UV-MS 


ESP 


22 


1.5 


20 


3.5 




28 


1.5 


15 


9 




30 


1.5 


15 


9 












APCI 


22 


0.5 


12 


23 
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Table 9 (continued) 



Probe 


oraoieni now raie (itil/ 
min) 


MaKe-up now raie [rnij 
min) 


lime aeiay (secona^ 
MS-FC 


1 ime aeiay ^secona ) 
UV-MS 




28 


0.5 


15 


12 




30 


0.5 


15 


12 



[0077] The mass spectrometer was calibrated with PEG 200 to PEG 1000 for mass calibration prior to delay time 
10 measurement. The mass spectrometer was tuned with two probes at different flow rates. 

[0078] A variety of compounds were passed through the HPLC column using the methods described herein. The 
compounds had molecular weights ranging from 100-650 g/mol, with sample sizes ranging from 0.1 to 200 mg, and 

injection volumes ranging from 50 to 2000 microliters. 

[0079] The following example illustrates the use of the high throughput method for purification. 

. 15 

Example 2: Purification of a Small Library of Compounds 



[0080] A proprietary library of compounds was selected, and a TLC of 5 of the compounds was obtained, using 80/20 
methanoi/water as eluent. The TLC Is shown below. 



20 



25 



30 



35 




TLC of proprietaiy compounds 



40 



[0081] The TLC Rfs were calculated as being approximately 0.11 and correlated to pre-defined analytical HPLC 
methods (methods II or III in Example 1). Methods II and III were separately evaluated to determine which provided 
the best separations. The chromatograms (shown below) showed that method II was better at purifying the compound 
45 from the impurities. 

[0082] The compounds of interest were purified on a preparative HPLC column, with an injection volume of 2 mL. 
and a concentration of 150 mg compound/ 2 mL DMF. The compounds had retention times of between three and three 
and a half minutes on the preparative HPLC column. 



50 



55 
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[0083] Following the purification of the representative sample of compounds, a TLC (shown below) was done on the 
entire library. The TLC shows that all of the compounds of interest elute with approximately the same Rf (about 0.1 ). 



I 




[0084] The preparative HPLC method was applied to all 450 compounds in the library, and an average purity of 91 % 
was obtained, with an average yield of 28 mg. More than 85% of the compounds had a purity greater than 95%. 
[0085] Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, 
many equivalents to the specific embodiments of the invention described herein. Such equivalents are intended to be 
encompassed by the following claims. 



Claims 

1. A method for purifying and/or characterizing one or more organic compounds, comprising the steps of: 

a) selecting a library of compounds to be purified 

b) performing a TLC and/or an analytical HPLC on a representative sample of the library, which sample com- 
prises less than ten percent of the library, 

c) determining a correlated preparative HPLC method depending on how the representative sample elutes off 
the analytical HPLC column and/or the sample moves on a TLC plate, and 

d) purifying all or substantially all of the library, 
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wherein the correlated preparative HPLC method is determined based on a correlation between three or 
more zones of retention times if analytical HPLC Is performed and/or three or more zones of retention factors if 
TLC is performed, such that if substantially all of the compounds in the representative sample fall within a particular 
zone, a con-elated preparative IHPLC protocol can be used to purify compounds that fall within the zone. 

5 

2. The method according to claim 1, wherein both analytical HPLC and TLC are performed in step a. 

3. The method according to claim 1 , wherein the representative sample is purified via preparative HPLC before all 
or substantially all of the library is purified. 

10 

4. The method according to claim 3, wherein after the representative sample is purified, the purity of the compounds 
in the sample is determined. 

5. The method according to claim 4, wherein after the purity is determined, a TLC is performed on between 10 and 
15 100 percent of the library under the same or substantially the same conditions as the TLC performed in step a. 

6. The method according to claim 5, wherein the TLC of the representative sample and the 10 to 100 percent of the 
library are compared. 

20 7. The method according to claim 5, wherein between 50 and 100 percent of the library is analyzed via TLC. 

8. The method according to claim 6, wherein if the TLC of the representative sample and the 10 to 100 percent of 
the library show that the compounds move into the same zone, then the entire library is purified via preparative 
HPLC. 

25 

9. The method according to claim 6. wherein if the TLC of the representative sample and the 10 to 100 percent of 
the library show that the compounds do not move into the same zone, then the entire library is purified via prepar- 
ative HPLC using an alternative method. 

30 10. The method according to claim 1 , wherein the representative sample is between 2 and 5 percent of the entire library. 

1 1 . The method of claim 1 , wherein during the preparative HPLC of all or substantially all of the library, fraction collection 
is triggered by detemiining the presence of the compound of interest in a sample which is eluting off of the pre- 
parative HPLC column using UV and/or MS detection. 

35 

12. The method of claim 1, wherein the purity of all or substantially all of the compounds purified via preparative HPLC 
is greater than 90 percent. 

13. The method of claim 12, wherein analytical HPLC is performed, if at all. only on a representative sample prior to 
40 performing preparative HPLC. 
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